Search CORE

1,633 research outputs found

“Knowledge Utilization” Universities: A Paradigm for Applying Academic Expertise to Social and Environmental Problems

Author: Bernstein Abraham B.
Publication venue: ScholarWorks at UMass Boston
Publication date: 01/04/1994
Field of study

Promoting the utilization of knowledge is an important mission for institutions of higher education. The knowledge utilization university should stand alongside the research university, the professional school, the liberal arts college, and the community college as one of the five archetypes of higher education institutions. Environmental problems typify a class of social problems that require the utilization of existing knowledge in a trans-disciplinary manner just as much as they require the creation of new knowledge through research. These problems are characterized by their multiple dimensions-- they have scientific, technical, social, political, economic, and ethical aspects, all of which must be taken into account in an integrated way when seeking solutions. Knowledge utilization universities can contribute to solving environmental problems in three ways: by offering interdisciplinary degree programs that address environmental issues; by offering on and off-campus programs that promote environmental literacy; and by involving their faculties in technical assistance and consulting activities with community organizations. To accomplish this, these institutions must develop organizational cultures that support the mission of promoting knowledge utilization. Cultures that were developed to encourage research or teaching are not likely to be effective in encouraging the activities that promote knowledge utilization

University of Massachusetts Boston: ScholarWorks at UMass

Intelligent Assistance for the Data Mining Process: An Ontology-based Approach

Author: Bernstein Abraham
Hill Shawndra
Provost Foster
Publication venue: Stern School of Business, New York University
Publication date: 01/01/2002
Field of study

A data mining (DM) process involves multiple stages. A simple, but typical, process might include preprocessing data, applying a data-mining algorithm, and postprocessing the mining results. There are many possible choices for each stage, and only some combinations are valid. Because of the large space and non-trivial interactions, both novices and data-mining specialists need assistance in composing and selecting DM processes. We present the concept of Intelligent Discovery Assistants (IDAs), which provide users with (i) systematic enumerations of valid DM processes, in order that important, potentially fruitful options are not overlooked, and (ii) effective rankings of these valid processes by different criteria, to facilitate the choice of DM processes to execute. We use a prototype to show that an IDA can indeed provide useful enumerations and effective rankings. We discuss how an IDA is an important tool for knowledge sharing among a team of data miners. Finally, we illustrate all the claims with a comprehensive demonstration using a more involved process and data from the 1998 KDDCUP competition.Information Systems Working Papers Serie

New York University Faculty Digital Archive

Towards Intelligent Assistance for a Data Mining Process:-

Author: Bernstein Abraham
Hill Shawndra
Provost Foster
Publication venue: IEEE Computer Society
Publication date: 01/04/2005
Field of study

A data mining (DM) process involves multiple stages. A simple, but typical, process might include preprocessing data, applying a data-mining algorithm, and postprocessing the mining results. There are many possible choices for each stage, and only some combinations are valid. Because of the large space and non-trivial interactions, both novices and data-mining specialists need assistance in composing and selecting DM processes. Extending notions developed for statistical expert systems we present a prototype Intelligent Discovery Assistant (IDA), which provides users with (i) systematic enumerations of valid DM processes, in order that important, potentially fruitful options are not overlooked, and (ii) effective rankings of these valid processes by different criteria, to facilitate the choice of DM processes to execute. We use the prototype to show that an IDA can indeed provide useful enumerations and effective rankings in the context of simple classification processes. We discuss how an IDA could be an important tool for knowledge sharing among a team of data miners. Finally, we illustrate the claims with a comprehensive demonstration of cost-sensitive classification using a more involved process and data from the 1998 KDDCUP competition.NYU, Stern School of Business, IOMS Department, Center for Digital Economy Researc

New York University Faculty Digital Archive

Intelligent Assistance for the Data Mining Process: An Ontology-based Approach

Author: Bernstein Abraham
Hill Shawndra
Provost Foster
Publication venue: Stern School of Business, New York University
Publication date: 01/01/2002
Field of study

New York University Faculty Digital Archive

07491 Abstracts Collection -- Mining Programs and Processes

Author: Bernstein Abraham
Gall Harald
Zeller Andreas
Publication venue: Dagstuhl Seminar Proceedings. 07491 - Mining Programs and Processes
Publication date: 01/01/2009
Field of study

From 02.12. to 17.12.2007, the Dagstuhl Seminar 07491 ``Mining Programs and Processes\u27\u27 was held in Schloss Dagstuhl~--~Leibniz Center for Informatics. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

Dagstuhl Research Online Publication Server

Data Analytics on Online Labor Markets: Opportunities and Challenges

Author: Bernstein Abraham
Feldman Michael
Juldaschewa Frida
Publication venue
Publication date: 06/07/2017
Field of study

The data-driven economy has led to a significant shortage of data scientists. To address this shortage, this study explores the prospects of outsourcing data analysis tasks to freelancers available on online labor markets (OLMs) by identifying the essential factors for this endeavor. Specifically, we explore the skills required from freelancers, collect information about the skills present on major OLMs, and identify the main hurdles for out-/crowd-sourcing data analysis. Adopting a sequential mixed-method approach, we interviewed 20 data scientists and subsequently surveyed 80 respondents from OLMs. Besides confirming the need for expected skills such as technical/mathematical capabilities, it also identifies less known ones such as domain understanding, an eye for aesthetic data visualization, good communication skills, and a natural understanding of the possibilities/limitations of data analysis in general. Finally, it elucidates obstacles for crowdsourcing like the communication overhead, knowledge gaps, quality assurance, and data confidentiality, which need to be mitigated

ZORA

The Relational Vector-space Model

Author: Bernstein Abraham
Clearwater Scott
Provost Foster
Publication venue: Stern School of Business, New York University
Publication date: 01/01/2003
Field of study

This paper addresses the classification of linked entities. We introduce a relational vector (VS) model (in analogy to the VS model used in information retrieval) that abstracts the linked structure, representing entities by vectors of weights. Given labeled data as background knowledge training data, classification procedures can be defined for this model, including a straightforward, "direct" model using weighted adjacency vectors. Using a large set of tasks from the domain of company affiliation identification, we demonstrate that such classification procedures can be effective. We then examine the method in more detail, showing that as expected the classification performance correlates with the- relational auto correlation of the data set. We then turn the tables and use the relational VS scores as a way to analyze/visualize the relational autocorrelation present in a complex linked structure. The main contribution of the paper 1s to introduce the relational VS model as a potentially useful addition to the toolkit for relational data mining. It could provide useful constructed features for domains with low to moderate relational autocorrelation; it may be effective by itself for domains with high levels of relational autocorrelation, and it provides a useful abstraction for analyzing the properties of linked data.Information Systems Working Papers Serie

New York University Faculty Digital Archive

A Multi-Stream Approach for Video Understanding

Author: Bernstein Abraham
Kunam Lutharsanen
Rossetto Luca
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 14/11/2022
Field of study

The automatic annotation of higher-level semantic information in long-form video content is still a challenging task. The Deep Video Understanding (DVU) Challenge aims at catalyzing progress in this area by offering common data and tasks. In this paper, we present our contribution to the 3rd DVU challenge. Our approach consists of multiple information streams extracted from both the visual and the audio modality. The streams can build on information generated by previous streams to increase their semantic descriptiveness. Finally, the output of all streams can be aggregated in order to produce a graph representation of the input movie to represent the semantic relationships between the relevant characters

ZORA